K Eyword S Potting on W Ord L Attices

نویسنده

  • Joel Praveen Pinto
چکیده

In spite of its numerous potential applications, Automatic Speech Recognition (ASR) remains a difficult (and mainly unsolved) problem. In addition to the intrinsic difficulty of the task, users tend to go beyond the pre-defined lexicon words, and the important keywords necessary to understand voice requests are often lost in extra words. In this context, it is often interesting to develop Keyword Spotting (KWS) approaches that will focus on the detection and recognition of pre-defined keywords lost in unconstrained, conversational, speech. The goal of this project is to perform confidence based keyword spotting on word lattices, which are a compact way for storing the most probable hypotheses previously generated by a Large Vocabulary Continuous Speech Recognizer (LVCSR). Every spoken utterance is turned into a word lattice, and fast search and rescoring techniques are then applied to detect keywords in the lattice, using their confidence scores to take an accept/reject decision. By doing so, more knowledge (lexical and syntactic) can be taken into account in the first pass (LVCSR pass), while limiting the search space in the second pass, hence also allowing us to use more complex algorithms. More specific to the work presented here, keyword hypotheses posteriors are estimated from the reduced search space (the lattices), and used as confidence scores to take the final decision. As posteriors minimize, by definition, the probability of error, we thus aim at minimizing the false alarm rate while maximizing true detection rates. Moreover, in order to take into account the word posterior probability mass dispersion among parallel lattice edges, various posterior rescoring techniques are investigated, all based on posterior accumulation over keyword hypotheses. Generation and rescoring of the above word lattices were all based on the popular HiddenMarkov Model (HMM) approaches. In this context, two particular instances of HMMs have been used and compared; the standard approach using Gaussian Mixture Models (GMMs) to estimate local (likelihood) scores, and a second one using a Multilayer Perceptron (MLP) directly estimating local posteriors.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A survey in Soweto (South Western Township) on the available health services.

OPSOMMING ’n O pnam e v an b esk ik b are geson d h eid sd ien ste in Sow eto is deur telefon iese en p ersoonlik e onderhoud e gem aak . D em ografïese en sosiaa l-ek on om iese in lig tin g oor d ie gebied, so o s op p erv lak te , k lim aat, b evo lk in g , fa s ilite ite v ir om gew ingsh ig iëne, w erksgeleen th ed e, on d erw y s en op voed in g w ord gegee. D ie a lgem en e g eson d h ei...

متن کامل

The Formation of Metabolites from Cephalosporin Compounds.

Klouwen, H. M. (1962). Arch. Biochem. Biophy8. 99, 116. Knox, W. E. (1960). In The Enzymes, 2nd ed., vol. 2, p. 253. Ed. by Boyer, P. D., Lardy, H. & Myrbaick, K. New York: Academic Press Inc. Kunkel, H. A., Hohne, G., Maas, H. & Schubert, G. (1955). Progrese in Radiobiology, p. 52. Lewis, S. E. & Wills, E. D. (1962). Biochem. Pharmacol. 11, 901. Mazia, D. (1961). In The Cell, vol. 3, p. 77. Ed...

متن کامل

Automatisk splitting av sammensatte ord-et lingvistisk hjelpemiddel for tekstsøking (Automatic splitting of compound words-A linguistic aid for text search) [In Norwegian]

Sammensatte ord skaper problemer ved ulike former for automatisk analyse av vokabularet i en tekst, f.eks, ved frekvensstudier. Problemet består i at menings­ innholdet i et sammensatt ord i mange tilfeller også kan beskrives i et uttrykk med de tilsvarende usammen­ satte ordene. I tekstsøking kan f.eks, de sammensatte ordene føre til at man ikke finner de dokumentene man søker etter fordi det ...

متن کامل

Effect of nutrient solution and Azolla and rice straw mixed compost on nutrition and growth of Dieffenbachia amoena in potting medium

This experiment was performed to investigate the effect of Azolla and rice straw mixed compost in peat substitution on Dieffenbachia amoena. Factorial experiment based on a completely randomized design with two factors of nutrient solution in 2 levels (1- without solution, 2- with 130 mg N/ l, 32 mg P /l and 117 mg K/ l) and 5 levels of Azolla and rice straw mixed compost (zero, 15, 30, 45 and ...

متن کامل

Distribution of Legionella longbeachae serogroup 1 and other legionellae in potting soils in Australia.

Legionella longbeachae serogroup 1 and other Legionella spp. were isolated from 73% of 45 potting soils made in Australia by 13 manufacturers but were not detected in 19 potting soils made in Greece, Switzerland, and the United Kingdom examined between March 1989 and May 1990. Several Legionella species were isolated from a small number of samples of uncomposted pine sawdusts, but it is not kno...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007